Skip to content

Conversation

@yannicks1
Copy link
Collaborator

@yannicks1 yannicks1 commented Oct 1, 2025

Supporting granite4 2b & 3b (attention only).

fms support has been merged into main here: foundation-model-stack/foundation-model-stack#472

implemented it in a backward compatible way, such that we can get it into vllm-spyre main without relying on a fms version post above PR.

@github-actions
Copy link

github-actions bot commented Oct 1, 2025

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Yannick Schnider <[email protected]>
Signed-off-by: Yannick Schnider <[email protected]>
self.kv_cache_specs['head_dim'] = self.config.hidden_size // \
self.config.num_attention_heads

# *** ALERT *** Granite 2b hack for AIU Compiler
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we know if this is a temporary hack? Then perhaps we can add a compatibility test

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will see how they implement it properly in fms. I do just mirror their strategy here... (very much work in progress)

@yannicks1
Copy link
Collaborator Author

bot:test

@yannicks1 yannicks1 changed the title [WIP] Granite 2b & 3b support Granite4 2b & 3b support Oct 15, 2025
@yannicks1
Copy link
Collaborator Author

bot:test

@yannicks1 yannicks1 marked this pull request as ready for review October 15, 2025 12:31
Copy link
Collaborator

@maxdebayser maxdebayser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. I suppose we can add the model to the know supported configurations once we have PELE test coverage.

cc: @ckadner , @joerunde , @jfialho

@yannicks1 yannicks1 merged commit de0e3d2 into main Oct 16, 2025
20 checks passed
@yannicks1 yannicks1 deleted the ysc-support-granite-2b-3b branch October 16, 2025 19:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants